Some Proposals towards a More Rigorous Corpus
نویسنده
چکیده
Over the past few decades, corpus linguistics has evolved into a fully-fledged methodological approach with an increasing number of scholars using various different methods. In this rather programmatic paper, I will argue, however, that corpus linguistics has, in some respects at least, still some way to go in terms of developing rigorous tools and methods and using them more often. More specifically, corpus linguistics – as the young discipline it still is – still has much to learn from other disciplines; a prime candidate in this respect is psycholinguistics. I will try to support this claim with arguments from several case studies.
منابع مشابه
Vocabulary Lists for EAP and Conversation Students
Despite the abundance of research investigating general and academic vocabularies and developing dozens of word lists, few studies have compared academic vocabulary with general service word lists such as conversation vocabulary. Many EAP researchers assume that university students need to know all the words in West’s (1953) General Service List (GSL) as a prerequisite to academic words (e.g., ...
متن کاملTowards a Corpus Annotated for Metonymies: the Case of Location Names
At the moment, language resources do not contain the necessary information for large-scale metonymy processing. As a contribution, we here present a corpus annotated for metonymies. We describe a framework for annotating metonymies in domain-independent text that considers the regularity, productivity and underspecification of metonymic usage. We then present a fully worked out annotation schem...
متن کاملA closer look at creativity as search
Several papers by Wiggins (building on ideas by Boden) have outlined a view of creative concept generation as a very general search process, but that formalisation has not been developed much in the past few years. Also, there are some aspects where clarification or spelling out of details would be useful. We present a re-formulation of the central ideas in Wiggins’s framework, with slightly mo...
متن کاملInteroperability of text corpus annotations with the semantic web
This paper explores the adaptation of the PubAnnotation model with recent more general proposals for the representation of annotations in the Semantic Web, referred to here as the Open Annotation model and the focus of the W3C Web Annotation Working Group. We argue that interoperability with standards under development for text annotation on the web, and with recent proposals related to nanopub...
متن کاملThe MATE/GNOME Proposals for Anaphoric Annotation, Revisited
In the five years since it was proposed, the MATE scheme for anaphoric annotation has been used in a variety of annotation projects, and the resulting corpora have been used to study both anaphora resolution and NL generation. Annotation tools inspired by the proposals have been used in some of these projects. In this paper we discuss these first experiences with the scheme, some lessons that h...
متن کامل